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We consider the set of finite random words A*, with independent letters drawn 
from a finite or infinite totally ordered alphabet according to a general probability 
distribution. On a specific subset of A*, considering certain factorization of the 
words which are labelled with the ranks, base on the lexicographical order, we 
prove that the normalized position of the ranks of factors, are uniform, when the 
length of the word goes to infinity. 

1. Introduction 

We consider a general probability distribution {pi)i>\ (pi > 0) on a set A = 
{ai < <22 < . . . } of letters, and we assume, without loss of generality, that < pi < 
1. On the corresponding set of words, A* = Un>o-^« the se ^ °f wor ds with 
length n), considering the word w £ A n that can be expressed as w = a^a^ . . . ag n 
(liS are the positive integer numbers), we define the weight p(w) as 

p(w) =Pi ± pi 2 ■■■Pi n - 

With the weight p{.), comes a probability measure on A n , V n ({w}) = p(w), and 
thus a probability measure on any subset of A n . Letting $ n denotes the cr-algebra 
generated by A n , the triple of (A n , 3n , P n ) is the corresponding probability space. 

We recall here some general definitions from [3] (readers can see also [SHU [5]). 
A word v is a factor of a word w if there exists two other words s and t, possibly 
empty, such that w — svt. If s is empty v is a prefix (or a right factor) of w and if 
t is empty v is a suffix (or a left factor) of w. 

A lexicographic order on the set of words A* is given by a total order on the 
alphabet A — {a±, a 2 , . . . } extended to the words in the following way: A word u, 
is said to be smaller than a word v if u is a prefix of v or u — ra^s and v = rajt 
such that i < j and r, s and t be some words, possibly empty. 

For w = Wi . . . w n , a word in A n {n > 0), we define tw — w-i . . . w n w\. Then 
(t) = {Id, t, . . . , r"™ 1 } is the group of cyclic permutations of the letters of a word 
with length n. The orbit (w) of a word w under (r) is called a necklace. 

A word w G An {n > 0) is called primitive if its necklace (w) has exactly n 
elements. In other words, a word w G A n is primitive if it is not a power of another 
word in A* (remark that a word w is a power of another word u, if w can be written 
as w = uuu . . . u) . Denote by V n the set of primitive words in A n and by Af n its 
complement. 

In this article we work on a subset of V n (n > 0) containing the primitive 
words which begin with a run of the own smallest letter and end with a run of 
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a letter different from the smallest. We denote this subset by W n - For example 
the word 0203040203 is a word in W5. The set of W n contains the words with 
certain properties that can be interesting in some applications of combinatorial on 
words. The Lyndon words with length n (the words which are strictly smaller than 
any their proper suffix), for example, are included in W n . We consider then the 
probability measure W„ on W n (the conditioning probability in the probability 
space of (An 5n j Pn)): 

"-«-»-» 

Now, we divide the words of W n to the factors that we call the blocks of word, 
in the following way: 

Definition 1.1. Let w be a word in W n and a w the smallest letter of w (remark 
that w begins with a w ). The blocks of w are the factors of w that begin with a run 
of "a w " and end just before very next run of "a w ". 

Thus, the blocks of a word are the factors in the form a w k °a^a^ . . . a\ m such 
that ai i ^ a w and fco > 0. 

Now, concerning the lexicographical order, each block of a word w in W n can be 
ranked, according to the order of the word in the necklace of (w) which begins by 
the mentioned block. For example, in the word w — aca 2 baedba 2 ba 3 d 2 b, the blocks 
and their related ranks below them, are: 

ac, a 2 b, aedb, a 2 b, a 3 d 2 b, 

4 3 5 2 1 

as there is the following order between the five words of the necklace (w) , which 
begin by the blocks of w. 

3 2 2 2 2322 2 232 

a d baca baedba b < a ba d baca bacdb < a baedba ba d bac 

< aca 2 baedba 2 bo? d 2 b < acdba 2 bo?d 2 baca 2 b. 

In this work, we are interested in the limiting distribution of the positions of 
ranks of a random word of W„ , which seems to be uniform. It is trivial that if the 
uniform random permutation of the blocks, causes the uniform displacements of the 
ranks, the purpose would be entailed. But in general, it is not true because of the 
existence of the equal blocks. In fact, it is well possible that certain permutations 
of ranks are not produced. For example, for the word w = a 2 b, a 2 b, ab, ab with the 
respective ranks 1, 2, 4, 3 of its blocks, the permutation 1, 2, 3, 4 of the ranks is not 
produced by any permutation of the blocks of w, no more any permutation of the 
cyclic permutation of 1, 2, 3, 4 as 4, 1, 2, 3. Moreover, certain permutations of the 
blocks of a word, like the permutation a 2 b, ab, a 2 b, ab of the blocks of w, produce 
the non-primitive words for which the ranks of the blocks are not defined. Never- 
theless, by definition of the ranks, it is trivial that the cyclic permutations of the 
blocks of a word causes the cyclic permutations of the related ranks. On the other 
hand, in any orbit of the cyclic permutation of the ranks, the ranks are uniformly 
distributed on all positions; the fact that is the key of the prove. 

Marchand & Zohoorian in [6l Section 6], for a random word of {a, b} n when n is 
sufficiently large, demonstrate hardly that the uniform permutation of the blocks 
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of the word, barring the block with rank 1, entails the uniform displacement of 
the rank 2 on all possible positions. However, our result may be applied also in 
the analyze of the height of the labelled binary trees emerged by the successive 
iterations of the standard factorization of the Lyndon words (an introduction on 
Lyndon trees is given by Marchand & Zohoorian in [6, Section 1] or by Bassino 
et al. in [I]). In fact, the height of Lyndon trees has a direct relation with the 
positions of the ranks in the root word. The study on the structure of Lyndon trees 
is a work in progress. The main theorem that we will prove in this article is: 

Theorem 1.1. The positions of ranks in a random word o/W„ ; divided by Elil^EliH^ 
converge in law, when n goes to infinity, to 

H(dx) = l [0jl] (x)dx, 

where dx denotes the Lebesgue measure on R. 

2. Number of the blocks of a word 

To begin, we concentrate on the number of the blocks of a random word in W n . 
It is evident that the number of the blocks of a word in W„ is equal to the number 
of the runs of its smallest letter. Now, we define the following function on V n that 
carries the words of V n to its subset W n : 

Definition 2.1. Let <f> denotes the function on V n that brings any word w of V n 
to itself if w is a word in W„, and otherwise to the word of (w) which begins by the 
last run of the smallest letter of w (see the following example). 



Example. 

• If w = aba 2 cb 2 a 2 , then 4>{ w ) — a?ba 2 cb 2 . 

• If A = {aba 2 c, acbab} C W5 then (j)~ 1 (A) = {aba 2 c, ba 2 ca}L){acbab, cbaba, babac}. 

We remark that </> is a surjective map on V n to W n and the inverse image of w £ W n 
is a subset of "P„ whose cardinality is equal to the length of the first block of w. 

The next lemma allows to transfer results from random words of A n to random 
words of W n - For this mean, we set 




for a > 1. 

Notation. If a and b are two functions from N into K, a{n) = O (b(n)) 3c > 
0, Vn € N, \a(n)\ < c\b{n)\. 

Lemma 2.1. For A C W n , we have: 

\W n {A)-F n {r\A))\=0{\\p\\l). 

Note that \\p\\ 1 = 1, and that, under the assumption {0 < p\ < 1}, \\p\\ a is 
strictly decreasing in a. Other well known inequalities include ||p|| 2 < ^/maxp^. 
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The lines of the proof of this lemma are exactly the same as in the proof of 
Lemma 2.1 of [2], but here we work on the set W n , instead of the set of Lyndon 
words, there. So we remove the proof, referring the reader to [5] Lemma 2.1]. 

Let now for any word w in A n we denote by N® 1 (w) the number of runs of the 
letter a± in the word w. We have then the following lemma: 

Lemma 2.2. (Number of runs of the letter ai). 

We remove the proof of this lemma, again inviting the reader to see [3] Lemma 
2.3]. 

By Lemma 12. 2[ one sees that the number of the blocks of a word with length 
n, when n is sufficiently large, is of the order n with a high probability as a word 
with length n has at most ^ blocks (the case where the smallest letter repeats 
alternatively). 

3. Displacements of ranks 

We consider, at first, some notations and definitions. As a consequence of Defi- 
nition [TTT1 any word w € W n can be decomposed uniquely as 

w = B 1 (w),B 2 (w), . . . ,B Nn{w) (w), 

in which Bi (w) 's stand for the blocks of w and N n (w) denotes the number of the 
blocks. We denote the respective ranks of the blocks of w by 

r(w) = r 1 (w),r 2 (w), . . . ,r Nn{w) (w), 

called, briefly, the rank of w. Obviously, r(w) is a permutation of 1, 2, ... , N n (w). 

Definition 3.1. For any w — Bi,B 2 , . . . , B^ € W n in which Bi 's are the blocks 
of w, we define (3w — B 2 , ■ ■ ■ , B^, B\. Then ((3) — {Id, (3, . . . , /S^ -1 } is the group 
of cyclic permutations of the blocks of a word in W n . We call the orbit (w) p of the 
blocks of w under {(3), the block orbit of w. 

Proposition 3.1. The ranks of the blocks of a word in W ra are invariant under the 
cyclic permutation of the blocks. 

Proof. By definition of the ranks, it is evident that the ranks are permuted cyclically 
as the blocks are permuted. □ 

The following corollary is an immediate result of the above proposition and the 
proof is left as it is evident. 

Corollary 3.1. In any block orbit, the ranks are distributed uniformly on all posi- 
tions. In other words, if the number of the words in a block orbit is N , the probability 
that the i-th rank, i = 1, . . . , N, be in the position k, k = 1, . . . , N, is equal to . 

Let & n denote the set of permutations of {1, . . . ,n}. For w € W„ , and a £ 

6jv„(w)i w e set 

a.w = B a(1) {w), . . . , B a(Nn{w)) (w). 
Conditioning then a.w € Wn, the rank of a.w, r(a.w), is also a permutation 
in ©jv„(iu)- We set C(w) — {a.w : a G <&n„(w) & G W n } and C n = 
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{C(w) : w G Wn}- Let € n denote the cr-algebra generated by C n . In the following 
proposition we see that W n is parted to C(w)s: 

Proposition 3.2. C n is a partition ofW n . 

Proof. Assume that w G W n , and w' G C(w): then w' G W n and w' has the same 
multiset of blocks as w (it has the same blocks, with the same multiplicity). As a 
consequence, for w,w' G W„, either C(w) = C(w') or C(w) n C(w') =0. □ 

In the following proposition, which is the key proposition of this result, we see 
that the block orbits divide C(w): 

Proposition 3.3. For any w G W„, C(w) is parted to the block orbits. 

Proof. As C(w) contains the primitive words produced by permutations of the 
blocks of w, it is sufficient to verify that all cyclic permutations of the blocks of 
a word in W n produce the primitive words. Suppose now that a cyclic permu- 
tation of the blocks of a word v = B\, B^ 1 ■ ■ ■ , Bn E W„, for example (3 l v — 
Bi + \, Bi + 2, ■ ■ ■ , Bn, Bi, . . . , Bi (1 < i < N), is a non-primitive word in the form u r 
for u E A* and 2 < r < n. As u is a prefix and a suffix of f3 l v and /3*w is a word in 
W n , u is a word which begins by its smallest letter and end with a letter different 
from the smallest letter. Therefore u can be factorized in the blocks and so has 
exactly the same blocks as the first blocks of f3 l v. Consequently, the blocks of j3 l v 
are periodically equals. That is, if u has j number of the blocks, they are equals 
to the first j, second j, ... and last j blocks of f3 z v. But this equalities, in f3 l v, 
entail that for any k = 1, 2, . . . , N — j, Bk — Bk+j and that all cyclic permutations 
of the blocks of f3 l v, especially v, will be the words which can be part to y equal 
factors. Therefore, any cyclic permutation of the blocks of a word in Wn, can not 
be a non-primitive word. □ 

4. Proof of Theorem 11.11 

Let B(w) — (Bi(w)) i>Q be the sequence of blocks of w, ended by an infinite 
sequence of empty words, and let f(w) = {ri(w)) i>Q be the corresponding sequence 
of ranks. 

Lemma 4.1. The weight p(.) , B, f and N n are £ n -measurable, and 



E 



Card(C) p(C) ^ 



Given that w € C , the positions of the ranks of w, (rj(w))i<i<7v ri ( ( 7), are distributed 
uniformly on {1, 2, . . . , A„(C)}. 

Proof. The weight p(w) depends only on the number of letters a±, a 2 , . . .that w 
contains, not on the order of the letters in w, so that is constant on each 
C G C n : thus, under W„, the conditional distribution of w given that w G C is 
Vc- As a consequence of Proposition 13.21 the relation in Lemma [4.11 is just the 
disintegration of W„ according to its conditional distributions given £„. Finally, 
Uco) is the image of the uniform probability on [&N n (w) x { w }] Pi W„. Thus, by 
Propositions 13. 1[ 13.31 and Corollary 13. 1[ under Uc(w), the positions of the ranks 
are distributed uniformly on {1, 2, . . . , N n (C)}. It follows that, under W n , the 
conditional distribution of the positions of the ranks given C(w), or given N n , is 
uniform too. □ 
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We can see, by Lemma l2~2"1 the probability that the number of blocks of a random 
word of W n , N n , be of order n, increases when n increases. Now, let the position 
of the i-th rank, denoted by I n , is distributed uniformly on {1, 2, . . . , N n (C)}. 
We put I n = Pl (\- Pl ) n Ini the normalized position of the i-th rank by Pl ^~ Pl ^ n. 
We shall see that I n is approximately uniform, for its distribution is close to the 
uniform distribution on [0, 1], that we note U in the rest of paper. This proximity is 
understood with respect to the £ 2 - Wasserstein metric W2O, •)■ The /^-Wasserstein 
metric W 2 (., .) is defined by 

(2) U .(//./'.; = inf 



£(X)=M 
C{Y)=y 



\X-Y\& 



in which /i and v are probability distributions on M. d , and || .|| 2 denotes the Euclidean 
norm on Mr. In this paper, we consider essentially the case d = 1. We mention that 
convergence of C{X n ) to C(X) with respect to W 2 (., .) entails convergence of X n 
to X in distribution, and we refer to [7] for an extensive treatment of Wasserstein 
metrics. In what follows, we shall improperly refer to the convergence of X n to X 
with respect to W 2 (., .), meaning the convergence of their distributions. The main 
reason for the asymptotic uniformity of X n is a form of convergence of the empirical 
distribution function G n (t) to t, in the notations of O Ch. 3.1, p. 85, display (3)]: 

Lemma 4.2. Consider a partition of [0, 1) into n intervals l 1 -^, ~) ; i = 1, • • • , n. 

For r a random cyclic permutation of the class of all cyclic permutations of the 
intervals, we have 

W 2 (t(*),U) < 

This Lemma is also a specific case of [6j Lemma 6.3] where one can see for a proof. 

Theorem II .11 is demonstrated when the following proposition is proved: 
Proposition 4.1. Let v n be the distribution ofI n under (W n ,W n ). We have then: 

w 2 (v n ,U) = o(vTaT) . 

As a consequence, under W n , the moments of I n converge to the corresponding 
moments of U. 

Proof. With the notations of Lemma [4.11 for C £ C n , let vc denote the image of 
Uc by I n , so that 

_ v Card(C) p(0 

Consider the blocks b\ < ■ ■ ■ < bpj n of a word w G W n , sorted in increasing 
lexicographic order. There exists at least one permutation t 6 6jy„ such that 
to = & T (i) ■■■b T (N n )- Let UIn„ denote the set of permutations 5 such that oj = 
5~ .t~^ .w be an element of C{w). Then, for a is a random uniform element of £K_/v„, 
oj = c ^ .t~ .w is a random uniform element of C(w). Set T(cr) = I n (a~ 1 .t^ 1 .w). 
Then, the distribution of T is vc(w) by Corollary 13.11 and Proposition 13.31 Thus, 
by a straightforward extension of Lemmas 14.21 and 12. 2\ 

1/2 / o 



(4) W 2 (v c{w) ,U) <E 



< 



Pi(l —pi)n 
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Finally, joining ((3]) and ((4]) we obtain 

and since < T n < 1, convergence of moments follows. □ 
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